Bidirectional Best Hits Miss Many Orthologs in Duplication-Rich Clades such as Plants and Animals
نویسندگان
چکیده
Bidirectional best hits (BBH), which entails identifying the pairs of genes in two different genomes that are more similar to each other than either is to any other gene in the other genome, is a simple and widely used method to infer orthology. A recent study has analyzed the link between BBH and orthology in bacteria and archaea and concluded that, given the very high consistency in BBH they observed among triplets of neighboring genes, a high proportion of BBH are likely to be bona fide orthologs. However, limited by their analysis setup, the previous study could not easily test the reverse question: which proportion of orthologs are BBH? In this follow-up study, we consider this question in theory and answer it based on conceptual arguments, simulated data, and real biological data from all three domains of life. Our analyses corroborate the findings of the previous study, but also show that because of the high rate of gene duplication in plants and animals, as much as 60% of orthologous relations are missed by the BBH criterion.
منابع مشابه
Evolutionary Analysis of MIKCc-Type MADS-Box Genes in Gymnosperms and Angiosperms
MIKCc-type MADS-box genes encode transcription factors that control floral organ morphogenesis and flowering time in flowering plants. Here, in order to determine when the subfamilies of MIKCc originated and their early evolutionary trajectory, we sampled and analyzed the genomes and large-scale transcriptomes representing all the orders of gymnosperms and basal angiosperms. Through phylogeneti...
متن کاملUnified modeling of gene duplication, loss, and coalescence using a locus tree.
Gene phylogenies provide a rich source of information about the way evolution shapes genomes, populations, and phenotypes. In addition to substitutions, evolutionary events such as gene duplication and loss (as well as horizontal transfer) play a major role in gene evolution, and many phylogenetic models have been developed in order to reconstruct and study these events. However, these models t...
متن کاملChoosing BLAST options for better detection of orthologs as reciprocal best hits
MOTIVATION The analyses of the increasing number of genome sequences requires shortcuts for the detection of orthologs, such as Reciprocal Best Hits (RBH), where orthologs are assumed if two genes each in a different genome find each other as the best hit in the other genome. Two BLAST options seem to affect alignment scores the most, and thus the choice of a best hit: the filtering of low info...
متن کاملGene loss and evolutionary rates following whole-genome duplication in teleost fishes.
Teleost fishes provide the first unambiguous support for ancient whole-genome duplication in an animal lineage. Studies in yeast or plants have shown that the effects of such duplications can be mediated by a complex pattern of gene retention and changes in evolutionary pressure. To explore such patterns in fishes, we have determined by phylogenetic analysis the evolutionary origin of 675 Tetra...
متن کاملThe Evolutionary History of Sarco(endo)plasmic Calcium ATPase (SERCA)
Investigating the phylogenetic relationships within physiologically essential gene families across a broad range of taxa can reveal the key gene duplication events underlying their family expansion and is thus important to functional genomics studies. P-Type II ATPases represent a large family of ATP powered transporters that move ions across cellular membranes and includes Na(+)/K(+) transport...
متن کامل